|
|
Accession Number |
TCMCG075C19793 |
gbkey |
CDS |
Protein Id |
XP_017979352.1 |
Location |
join(21040387..21040587,21040710..21040883,21041036..21041135,21041449..21041925,21042713..21043435,21044058..21044149,21044599..21044672,21044845..21045067,21045258..21045351,21045501..21045655,21045964..21046188,21046739..21046939,21047397..21047522,21047600..21047661,21048035..21048152,21048436..21048720,21048808..21048846) |
Gene |
LOC18596361 |
GeneID |
18596361 |
Organism |
Theobroma cacao |
|
|
Length |
1122aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018123863.1
|
Definition |
PREDICTED: peroxisome biogenesis protein 1 isoform X1 [Theobroma cacao] |
CDS: ATGGAGTTTGAGGTGAGACACGTGGCAGGAATAGAGGACTGCTTCGTATCTCTTCCACTCCTACTCATCCAAACCCTTCAATCCACGCGCTCTTCTCTCCTCCCTCCCCTTCTCGCTCTCGAGCTTCGCCTCCCACGCTCCTCCGACCACCCCTGGATCGTCGCTTGGTCCGGCGCTGCTTCTTCTTCCACTGCTATTGAGGTTTCTCAACAATTTGCAGAATGTATATCTTTGCCCAATCACACCACAGTTCAAGTACGAGCAGCTTCTAATATGGCAAAGGCTACATTAGTCACAATTGAACCTCATACCGAGGATGATTGGGAAGTTTTAGAGCTTAACTCTGAGCACGCAGAAGCTGCTATATTAAAGCAGGTCAGGATTGTCCATGAAGGAATGCGATTTCCTCTGTGGTTGCATGGCCGCACGATCGTAACTTTCCTAGTGGTTTCAACCTTTCCCAAGAAAGCGGTGGTTCAACTTGTCCCTGGAACAGAAGTTGCTGTTGCTCCAAAGAGACGTGAGAAAAATTTAAACAACATGGAATCGTCTACCAGAGAATCTCATGGTGCAAAAGCACTGCTACGTTTGCAAGATTCGGACAGAAGATTGTTTCACAAAAGCAATGTCAAAGGTGTTGAGCTTGGGGTAGCACTCACTTCTGTCGCCTTTATTCATCAAGTAACAGCTAAAAGATTTTCATTGGAGTCTCTTCAGTTGGTTGTTATAGTGCCAAGATTGTCATCCAAAGGGAGTGTGAAGAATCTGGAAAATGATGCCTTGAGAATGAAAGGAAGTTTAACTTCCAAGGAAGTAAATAGTGGAATTTCAACTGATAATAAGGAATTTCGTCAAGTGATTGTTCACCTTTTAATTTCAGATTCAGTGGCTGAAGGACATGTAATGATTACTCGCTCTCTTCGGCTTTATTTGAGAGCAGGACTACATTCATGGGTTTATTTAAAGGGCTATAATGTTGCTTTGAAGAAGGAAATTTCTGTACTGTCACTTTCTCCATGCCACTTCAAGATGGTTGCAAATGATAAGGAGAATGGTCTTGAAGTGCTTGATGGCCATAAAACTCGTAGGATGAAAAACTCTGGTTCAGGAACCTCTTTAGAGGTAGTAAATTGGTCAACCCATGATGATGTTGTGGCTGTTCTTTCTTCTGAATTTCCTTTCCAAGAGGCTGAAGACTCCAGTCAGGAAGACACTAAAAAGGGCTTAGAATGTCTTCTTCGTGCATGGTTTCTTGCTCAACTTGATGCTATAGCTTCAAATGCAGGGACGGAAGTTAAGACATTGGTTTTGGGGAATGAAAATCTACTTCACTTTGAGGTGAACAGATATGATTCTGGGACTTACGGACTAGTCTCATCTAATGGTTTTTCAGAAAAGAGAAATAAGACTAAGGACTTGCCGGTGGAAATTTCATACATATTGACCATTTCTGAGGAACTACTGCACAGTGGAAATGTTAATGCGTATGAGCTTGCCCTTGATGATAGAAACAAGAGGAATGATGTTCAGGGCGGTTTCGAGTTGTTTGGAAAGCTAAATTTGGGTAACCCTATGTCCCTATATTCTGTTAAAGACAGAACATCTGTCAAGGGGTTTAGCACAAATGCATCTTCATTAAGCTGGATGGGTGTGACTGCTTCTGATGTTATCAATAGAATGATGGTGTTGTTAGCTCCTGCTTCTGGAATTTGGTTTAGTACTTACAATCTTCCTCTCCCAGGACATGTTCTAATATATGGACCTGCGGGTTCTGGAAAGACATTATTGGCTAGAGCTGTTGCAAAGTCCCTTGAAGAACATAAAGACCTGTTAGCACATGTAATCTTTATATGTTGCTCAGGGCTTGCTTTAGAGAAGCCCCCAACCATTCGTCAAGCGCTTTCAAGTTTTGTGTCTGAAGCTCTAGATCATGCACCTTCAGTTGTTGTTTTTGATGATCTTGATAGTATCATCCAATCTTCATCTGACTCAGAAGGATCCCAACCTTCAACCTCAGTTGTTGCACTTACTAAATTTCTCACTGACATTATTGATGAATATGGAGAAAAGAGGAAGAGCTCCTGTGGTATTGGTCCAATAGCTTTTATAGCTTCTGTGCAGTCTCTGGAGAGTATCCCTCAGTCTTTGAGCTCATCAGGAAGGTTTGACTTTCATGTGCAACTACCTGCACCTGCTGCCTCTGAACGTGGGGCCATATTGAAGCATGAAATTCAGAGGCGTTCCCTACAATGTCATGATGACATCTTACTTGATGTAGCTTCCAAATGTGATGGATATGATGCATATGATCTGGAAATATTGGTTGATAGAGCTGTTCATGCCGCCATTGGTCGGTTTTTGCCTTCTGATTCTGAAGAATACGTGAAGCCCATTTTAGTTAGGGAGGATTTCTCTCATGCTATGCATGAGTTCCTTCCAGTTGCCATGCGTGACATTACTAAATCTGCTCCTGAAGTTGGTCGCTCTGGTTGGGATGATGTTGGTGGTCTCAATGACATTCGAGATGCTATCAAAGAGATGATTGAAATGCCTTCAAAGTTTCCGAATATATTTGCACAAGCTCCTTTAAGGTTGCGGTCTAATGTTCTCTTATATGGTCCTCCTGGCTGTGGTAAAACCCACATTGTTGGTGCTGCTGCTGCCGCTTGTTCACTAAGATTTATATCGGTGAAAGGGCCTGAGCTACTGAACAAATACATTGGTGCTTCTGAGCAAGCTGTTCGAGATATTTTTTCAAAGGCAGCTGCTGCAGCGCCATGCCTCCTCTTTTTTGATGAATTTGATTCCATTGCACCTAAAAGAGGGCATGACAACACTGGAGTAACTGATAGAGTTGTTAATCAATTCCTAACAGAATTAGATGGCGTTGAAGTTTTGACTGGTGTATTTGTGTTTGCTGCAACAAGTAGACCAGATCTGCTTGATGCTGCATTGCTGAGACCAGGTAGGCTCGATCGCCTCCTTTTCTGTGATTTTCCATCTCGGCGTGAGAGGTTGGATGTTCTGACTGTTCTTTCTAGAAAGCTACCATTAGCCAGTGATGTTGATTTAGGCGCCATAGCTTGTATGACAGAAGGATTTAGCGGAGCTGATCTCCAAGCTCTTCTCTCAGACGCACAGCTTGCTGCAGTTCATGAACATTTGAGCAGTGTGAGTAGCAATGAGCCTGGAAAAATGCCAGTCATAACTGATGGTGTTTTGAAGTCTATTGCTTCAAAGGCAAGACCATCAGTTTCAGAAACCGAGAAGCAGAGACTTTATGGCATCTACAGTCAGTTTCTGGATTCAAAGAGATCCGTTGCTGCACAGTCAAGGGATGCAAAAGGCAAGAGGGCAACTCTGGCATGA |
Protein: MEFEVRHVAGIEDCFVSLPLLLIQTLQSTRSSLLPPLLALELRLPRSSDHPWIVAWSGAASSSTAIEVSQQFAECISLPNHTTVQVRAASNMAKATLVTIEPHTEDDWEVLELNSEHAEAAILKQVRIVHEGMRFPLWLHGRTIVTFLVVSTFPKKAVVQLVPGTEVAVAPKRREKNLNNMESSTRESHGAKALLRLQDSDRRLFHKSNVKGVELGVALTSVAFIHQVTAKRFSLESLQLVVIVPRLSSKGSVKNLENDALRMKGSLTSKEVNSGISTDNKEFRQVIVHLLISDSVAEGHVMITRSLRLYLRAGLHSWVYLKGYNVALKKEISVLSLSPCHFKMVANDKENGLEVLDGHKTRRMKNSGSGTSLEVVNWSTHDDVVAVLSSEFPFQEAEDSSQEDTKKGLECLLRAWFLAQLDAIASNAGTEVKTLVLGNENLLHFEVNRYDSGTYGLVSSNGFSEKRNKTKDLPVEISYILTISEELLHSGNVNAYELALDDRNKRNDVQGGFELFGKLNLGNPMSLYSVKDRTSVKGFSTNASSLSWMGVTASDVINRMMVLLAPASGIWFSTYNLPLPGHVLIYGPAGSGKTLLARAVAKSLEEHKDLLAHVIFICCSGLALEKPPTIRQALSSFVSEALDHAPSVVVFDDLDSIIQSSSDSEGSQPSTSVVALTKFLTDIIDEYGEKRKSSCGIGPIAFIASVQSLESIPQSLSSSGRFDFHVQLPAPAASERGAILKHEIQRRSLQCHDDILLDVASKCDGYDAYDLEILVDRAVHAAIGRFLPSDSEEYVKPILVREDFSHAMHEFLPVAMRDITKSAPEVGRSGWDDVGGLNDIRDAIKEMIEMPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASEQAVRDIFSKAAAAAPCLLFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSRRERLDVLTVLSRKLPLASDVDLGAIACMTEGFSGADLQALLSDAQLAAVHEHLSSVSSNEPGKMPVITDGVLKSIASKARPSVSETEKQRLYGIYSQFLDSKRSVAAQSRDAKGKRATLA |